Dataset statistics
| Number of variables | 22 |
|---|---|
| Number of observations | 7109 |
| Missing cells | 0 |
| Missing cells (%) | 0.0% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 3.8 MiB |
| Average record size in memory | 558.7 B |
Variable types
| NUM | 11 |
|---|---|
| CAT | 8 |
| DATE | 2 |
| BOOL | 1 |
Reproduction
| Analysis started | 2022-09-12 09:45:18.235497 |
|---|---|
| Analysis finished | 2022-09-12 09:45:56.889383 |
| Duration | 38.65 seconds |
| Version | pandas-profiling v2.7.1 |
| Command line | pandas_profiling --config_file config.yaml [YOUR_FILE.csv] |
| Download configuration | config.yaml |
AREA
Categorical
| Distinct count | 7 |
|---|---|
| Unique (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 55.7 KiB |
| Chrompet | |
|---|---|
| Karapakkam | |
| KK Nagar | |
| Velachery | |
| Anna Nagar | |
| Other values (2) |
| Value | Count | Frequency (%) | |
| Chrompet | 1702 | 23.9% | |
| Karapakkam | 1366 | 19.2% | |
| KK Nagar | 997 | 14.0% | |
| Velachery | 981 | 13.8% | |
| Anna Nagar | 788 | 11.1% | |
| Adyar | 774 | 10.9% | |
| T Nagar | 501 | 7.0% |
Length
| Max length | 10 |
|---|---|
| Mean length | 8.346884231 |
| Min length | 5 |
| Value | Count | Frequency (%) | |
| Lowercase_Letter | 15 | 68.2% | |
| Uppercase_Letter | 6 | 27.3% | |
| Space_Separator | 1 | 4.5% |
| Value | Count | Frequency (%) | |
| Latin | 21 | 95.5% | |
| Common | 1 | 4.5% |
| Value | Count | Frequency (%) | |
| ASCII | 22 | 100.0% |
| Distinct count | 1699 |
|---|---|
| Unique (%) | 23.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1382.0730060486708 |
|---|---|
| Minimum | 500 |
| Maximum | 2500 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 55.7 KiB |
Quantile statistics
| Minimum | 500 |
|---|---|
| 5-th percentile | 702 |
| Q1 | 993 |
| median | 1373 |
| Q3 | 1744 |
| 95-th percentile | 2084.6 |
| Maximum | 2500 |
| Range | 2000 |
| Interquartile range (IQR) | 751 |
Descriptive statistics
| Standard deviation | 457.4109025 |
|---|---|
| Coefficient of variation (CV) | 0.3309600147 |
| Kurtosis | -0.8863792596 |
| Mean | 1382.073006 |
| Median Absolute Deviation (MAD) | 376 |
| Skewness | 0.1312376308 |
| Sum | 9825157 |
| Variance | 209224.7337 |
| Value | Count | Frequency (%) | |
| 1781 | 18 | 0.3% | |
| 1538 | 15 | 0.2% | |
| 1514 | 13 | 0.2% | |
| 1505 | 13 | 0.2% | |
| 786 | 12 | 0.2% | |
| 961 | 12 | 0.2% | |
| 1081 | 12 | 0.2% | |
| 1634 | 12 | 0.2% | |
| 1655 | 12 | 0.2% | |
| 1622 | 11 | 0.2% | |
| Other values (1689) | 6979 | 98.2% |
| Value | Count | Frequency (%) | |
| 500 | 3 | < 0.1% | |
| 501 | 2 | < 0.1% | |
| 502 | 1 | < 0.1% | |
| 504 | 2 | < 0.1% | |
| 505 | 1 | < 0.1% |
| Value | Count | Frequency (%) | |
| 2500 | 1 | < 0.1% | |
| 2499 | 1 | < 0.1% | |
| 2498 | 1 | < 0.1% | |
| 2497 | 1 | < 0.1% | |
| 2496 | 3 | < 0.1% |
DATE_SALE
Date
| Distinct count | 2798 |
|---|---|
| Unique (%) | 39.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 55.7 KiB |
| Minimum | 2004-01-02 00:00:00 |
|---|---|
| Maximum | 2015-12-02 00:00:00 |
DIST_MAINROAD
Real number (ℝ≥0)
| Distinct count | 201 |
|---|---|
| Unique (%) | 2.8% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 99.60317906878605 |
|---|---|
| Minimum | 0 |
| Maximum | 200 |
| Zeros | 33 |
| Zeros (%) | 0.5% |
| Memory size | 55.7 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 10 |
| Q1 | 50 |
| median | 99 |
| Q3 | 148 |
| 95-th percentile | 190 |
| Maximum | 200 |
| Range | 200 |
| Interquartile range (IQR) | 98 |
Descriptive statistics
| Standard deviation | 57.40310959 |
|---|---|
| Coefficient of variation (CV) | 0.5763180465 |
| Kurtosis | -1.165240378 |
| Mean | 99.60317907 |
| Median Absolute Deviation (MAD) | 49 |
| Skewness | 0.01814383556 |
| Sum | 708079 |
| Variance | 3295.11699 |
| Value | Count | Frequency (%) | |
| 39 | 56 | 0.8% | |
| 51 | 53 | 0.7% | |
| 78 | 52 | 0.7% | |
| 77 | 49 | 0.7% | |
| 14 | 48 | 0.7% | |
| 156 | 48 | 0.7% | |
| 73 | 48 | 0.7% | |
| 49 | 47 | 0.7% | |
| 111 | 47 | 0.7% | |
| 190 | 46 | 0.6% | |
| Other values (191) | 6615 | 93.1% |
| Value | Count | Frequency (%) | |
| 0 | 33 | 0.5% | |
| 1 | 28 | 0.4% | |
| 2 | 44 | 0.6% | |
| 3 | 27 | 0.4% | |
| 4 | 46 | 0.6% |
| Value | Count | Frequency (%) | |
| 200 | 38 | 0.5% | |
| 199 | 30 | 0.4% | |
| 198 | 30 | 0.4% | |
| 197 | 38 | 0.5% | |
| 196 | 36 | 0.5% |
N_BEDROOM
Categorical
| Distinct count | 4 |
|---|---|
| Unique (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 55.7 KiB |
| 1 | |
|---|---|
| 2 | |
| 3 | 707 |
| 4 | 254 |
| Value | Count | Frequency (%) | |
| 1 | 3796 | 53.4% | |
| 2 | 2352 | 33.1% | |
| 3 | 707 | 9.9% | |
| 4 | 254 | 3.6% |
Length
| Max length | 1 |
|---|---|
| Mean length | 1 |
| Min length | 1 |
| Value | Count | Frequency (%) | |
| Decimal_Number | 4 | 100.0% |
| Value | Count | Frequency (%) | |
| Common | 4 | 100.0% |
| Value | Count | Frequency (%) | |
| ASCII | 4 | 100.0% |
N_BATHROOM
Categorical
| Distinct count | 2 |
|---|---|
| Unique (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 55.7 KiB |
| 1 | |
|---|---|
| 2 |
| Value | Count | Frequency (%) | |
| 1 | 5594 | 78.7% | |
| 2 | 1515 | 21.3% |
Length
| Max length | 1 |
|---|---|
| Mean length | 1 |
| Min length | 1 |
| Value | Count | Frequency (%) | |
| Decimal_Number | 2 | 100.0% |
| Value | Count | Frequency (%) | |
| Common | 2 | 100.0% |
| Value | Count | Frequency (%) | |
| ASCII | 2 | 100.0% |
| Distinct count | 5 |
|---|---|
| Unique (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 3.6887044591363063 |
|---|---|
| Minimum | 2 |
| Maximum | 6 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 55.7 KiB |
Quantile statistics
| Minimum | 2 |
|---|---|
| 5-th percentile | 2 |
| Q1 | 3 |
| median | 4 |
| Q3 | 4 |
| 95-th percentile | 5 |
| Maximum | 6 |
| Range | 4 |
| Interquartile range (IQR) | 1 |
Descriptive statistics
| Standard deviation | 1.019098916 |
|---|---|
| Coefficient of variation (CV) | 0.2762755671 |
| Kurtosis | -0.5307863127 |
| Mean | 3.688704459 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | 0.1188007656 |
| Sum | 26223 |
| Variance | 1.038562601 |
| Value | Count | Frequency (%) | |
| 4 | 2563 | 36.1% | |
| 3 | 2125 | 29.9% | |
| 5 | 1246 | 17.5% | |
| 2 | 921 | 13.0% | |
| 6 | 254 | 3.6% |
| Value | Count | Frequency (%) | |
| 2 | 921 | 13.0% | |
| 3 | 2125 | 29.9% | |
| 4 | 2563 | 36.1% | |
| 5 | 1246 | 17.5% | |
| 6 | 254 | 3.6% |
| Value | Count | Frequency (%) | |
| 6 | 254 | 3.6% | |
| 5 | 1246 | 17.5% | |
| 4 | 2563 | 36.1% | |
| 3 | 2125 | 29.9% | |
| 2 | 921 | 13.0% |
SALE_COND
Categorical
| Distinct count | 5 |
|---|---|
| Unique (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 55.7 KiB |
| adj land | |
|---|---|
| partial | |
| normal sale | |
| abnormal | |
| family |
| Value | Count | Frequency (%) | |
| adj land | 1439 | 20.2% | |
| partial | 1433 | 20.2% | |
| normal sale | 1423 | 20.0% | |
| abnormal | 1411 | 19.8% | |
| family | 1403 | 19.7% |
Length
| Max length | 11 |
|---|---|
| Mean length | 8.004220003 |
| Min length | 6 |
| Value | Count | Frequency (%) | |
| Lowercase_Letter | 16 | 94.1% | |
| Space_Separator | 1 | 5.9% |
| Value | Count | Frequency (%) | |
| Latin | 16 | 94.1% | |
| Common | 1 | 5.9% |
| Value | Count | Frequency (%) | |
| ASCII | 17 | 100.0% |
PARK_FACIL
Boolean
| Distinct count | 2 |
|---|---|
| Unique (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 55.7 KiB |
| Yes | |
|---|---|
| No |
| Value | Count | Frequency (%) | |
| Yes | 3587 | 50.5% | |
| No | 3522 | 49.5% |
DATE_BUILD
Date
| Distinct count | 5808 |
|---|---|
| Unique (%) | 81.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 55.7 KiB |
| Minimum | 1949-10-28 00:00:00 |
|---|---|
| Maximum | 2010-12-11 00:00:00 |
BUILDTYPE
Categorical
| Distinct count | 3 |
|---|---|
| Unique (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 55.7 KiB |
| House | |
|---|---|
| Others | |
| Commercial |
| Value | Count | Frequency (%) | |
| House | 2444 | 34.4% | |
| Others | 2336 | 32.9% | |
| Commercial | 2329 | 32.8% |
Length
| Max length | 10 |
|---|---|
| Mean length | 6.966661978 |
| Min length | 5 |
| Value | Count | Frequency (%) | |
| Lowercase_Letter | 12 | 80.0% | |
| Uppercase_Letter | 3 | 20.0% |
| Value | Count | Frequency (%) | |
| Latin | 15 | 100.0% |
| Value | Count | Frequency (%) | |
| ASCII | 15 | 100.0% |
UTILITY_AVAIL
Categorical
| Distinct count | 3 |
|---|---|
| Unique (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 55.7 KiB |
| NoSewr | |
|---|---|
| All Pub | |
| ELO |
| Value | Count | Frequency (%) | |
| NoSewr | 3700 | 52.0% | |
| All Pub | 1887 | 26.5% | |
| ELO | 1522 | 21.4% |
Length
| Max length | 7 |
|---|---|
| Mean length | 5.623153749 |
| Min length | 3 |
| Value | Count | Frequency (%) | |
| Lowercase_Letter | 7 | 46.7% | |
| Uppercase_Letter | 7 | 46.7% | |
| Space_Separator | 1 | 6.7% |
| Value | Count | Frequency (%) | |
| Latin | 14 | 93.3% | |
| Common | 1 | 6.7% |
| Value | Count | Frequency (%) | |
| ASCII | 15 | 100.0% |
STREET
Categorical
| Distinct count | 3 |
|---|---|
| Unique (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 55.7 KiB |
| Paved | |
|---|---|
| Gravel | |
| No Access |
| Value | Count | Frequency (%) | |
| Paved | 2572 | 36.2% | |
| Gravel | 2520 | 35.4% | |
| No Access | 2017 | 28.4% |
Length
| Max length | 9 |
|---|---|
| Mean length | 6.48937966 |
| Min length | 5 |
| Value | Count | Frequency (%) | |
| Lowercase_Letter | 9 | 64.3% | |
| Uppercase_Letter | 4 | 28.6% | |
| Space_Separator | 1 | 7.1% |
| Value | Count | Frequency (%) | |
| Latin | 13 | 92.9% | |
| Common | 1 | 7.1% |
| Value | Count | Frequency (%) | |
| ASCII | 14 | 100.0% |
MZZONE
Categorical
| Distinct count | 6 |
|---|---|
| Unique (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 55.7 KiB |
| RL | |
|---|---|
| RH | |
| RM | |
| C | |
| A |
| Value | Count | Frequency (%) | |
| RL | 1858 | 26.1% | |
| RH | 1822 | 25.6% | |
| RM | 1817 | 25.6% | |
| C | 550 | 7.7% | |
| A | 537 | 7.6% | |
| I | 525 | 7.4% |
Length
| Max length | 2 |
|---|---|
| Mean length | 1.773245182 |
| Min length | 1 |
| Value | Count | Frequency (%) | |
| Uppercase_Letter | 7 | 100.0% |
| Value | Count | Frequency (%) | |
| Latin | 7 | 100.0% |
| Value | Count | Frequency (%) | |
| ASCII | 7 | 100.0% |
QS_ROOMS
Real number (ℝ≥0)
| Distinct count | 31 |
|---|---|
| Unique (%) | 0.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 3.5174708116472075 |
|---|---|
| Minimum | 2.0 |
| Maximum | 5.0 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 55.7 KiB |
Quantile statistics
| Minimum | 2 |
|---|---|
| 5-th percentile | 2.1 |
| Q1 | 2.7 |
| median | 3.5 |
| Q3 | 4.3 |
| 95-th percentile | 4.9 |
| Maximum | 5 |
| Range | 3 |
| Interquartile range (IQR) | 1.6 |
Descriptive statistics
| Standard deviation | 0.8919724311 |
|---|---|
| Coefficient of variation (CV) | 0.2535834635 |
| Kurtosis | -1.197535123 |
| Mean | 3.517470812 |
| Median Absolute Deviation (MAD) | 0.8 |
| Skewness | -0.01895704371 |
| Sum | 25005.7 |
| Variance | 0.7956148178 |
| Value | Count | Frequency (%) | |
| 2.5 | 265 | 3.7% | |
| 3.8 | 259 | 3.6% | |
| 3.6 | 255 | 3.6% | |
| 4.6 | 252 | 3.5% | |
| 3.9 | 245 | 3.4% | |
| 4.9 | 242 | 3.4% | |
| 3.4 | 240 | 3.4% | |
| 4.8 | 239 | 3.4% | |
| 4.2 | 239 | 3.4% | |
| 3.3 | 239 | 3.4% | |
| Other values (21) | 4634 | 65.2% |
| Value | Count | Frequency (%) | |
| 2 | 203 | 2.9% | |
| 2.1 | 236 | 3.3% | |
| 2.2 | 213 | 3.0% | |
| 2.3 | 224 | 3.2% | |
| 2.4 | 208 | 2.9% |
| Value | Count | Frequency (%) | |
| 5 | 228 | 3.2% | |
| 4.9 | 242 | 3.4% | |
| 4.8 | 239 | 3.4% | |
| 4.7 | 239 | 3.4% | |
| 4.6 | 252 | 3.5% |
QS_BATHROOM
Real number (ℝ≥0)
| Distinct count | 31 |
|---|---|
| Unique (%) | 0.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 3.507244338162892 |
|---|---|
| Minimum | 2.0 |
| Maximum | 5.0 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 55.7 KiB |
Quantile statistics
| Minimum | 2 |
|---|---|
| 5-th percentile | 2.1 |
| Q1 | 2.7 |
| median | 3.5 |
| Q3 | 4.3 |
| 95-th percentile | 4.9 |
| Maximum | 5 |
| Range | 3 |
| Interquartile range (IQR) | 1.6 |
Descriptive statistics
| Standard deviation | 0.8978337054 |
|---|---|
| Coefficient of variation (CV) | 0.2559940565 |
| Kurtosis | -1.21625135 |
| Mean | 3.507244338 |
| Median Absolute Deviation (MAD) | 0.8 |
| Skewness | 0.0003104318578 |
| Sum | 24933 |
| Variance | 0.8061053625 |
| Value | Count | Frequency (%) | |
| 2.7 | 256 | 3.6% | |
| 4.8 | 255 | 3.6% | |
| 3.7 | 251 | 3.5% | |
| 4.7 | 247 | 3.5% | |
| 4.9 | 245 | 3.4% | |
| 3 | 241 | 3.4% | |
| 4.2 | 237 | 3.3% | |
| 3.4 | 234 | 3.3% | |
| 2.2 | 234 | 3.3% | |
| 4.6 | 234 | 3.3% | |
| Other values (21) | 4675 | 65.8% |
| Value | Count | Frequency (%) | |
| 2 | 222 | 3.1% | |
| 2.1 | 224 | 3.2% | |
| 2.2 | 234 | 3.3% | |
| 2.3 | 220 | 3.1% | |
| 2.4 | 230 | 3.2% |
| Value | Count | Frequency (%) | |
| 5 | 219 | 3.1% | |
| 4.9 | 245 | 3.4% | |
| 4.8 | 255 | 3.6% | |
| 4.7 | 247 | 3.5% | |
| 4.6 | 234 | 3.3% |
QS_BEDROOM
Real number (ℝ≥0)
| Distinct count | 31 |
|---|---|
| Unique (%) | 0.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 3.485300323533549 |
|---|---|
| Minimum | 2.0 |
| Maximum | 5.0 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 55.7 KiB |
Quantile statistics
| Minimum | 2 |
|---|---|
| 5-th percentile | 2.1 |
| Q1 | 2.7 |
| median | 3.5 |
| Q3 | 4.3 |
| 95-th percentile | 4.9 |
| Maximum | 5 |
| Range | 3 |
| Interquartile range (IQR) | 1.6 |
Descriptive statistics
| Standard deviation | 0.8872664105 |
|---|---|
| Coefficient of variation (CV) | 0.2545738755 |
| Kurtosis | -1.190165265 |
| Mean | 3.485300324 |
| Median Absolute Deviation (MAD) | 0.8 |
| Skewness | 0.01728160906 |
| Sum | 24777 |
| Variance | 0.7872416831 |
| Value | Count | Frequency (%) | |
| 2.6 | 273 | 3.8% | |
| 3.2 | 253 | 3.6% | |
| 4 | 248 | 3.5% | |
| 2.4 | 244 | 3.4% | |
| 3.8 | 244 | 3.4% | |
| 3.1 | 243 | 3.4% | |
| 2.1 | 242 | 3.4% | |
| 3 | 241 | 3.4% | |
| 3.4 | 239 | 3.4% | |
| 4.4 | 237 | 3.3% | |
| Other values (21) | 4645 | 65.3% |
| Value | Count | Frequency (%) | |
| 2 | 221 | 3.1% | |
| 2.1 | 242 | 3.4% | |
| 2.2 | 237 | 3.3% | |
| 2.3 | 200 | 2.8% | |
| 2.4 | 244 | 3.4% |
| Value | Count | Frequency (%) | |
| 5 | 217 | 3.1% | |
| 4.9 | 203 | 2.9% | |
| 4.8 | 211 | 3.0% | |
| 4.7 | 228 | 3.2% | |
| 4.6 | 233 | 3.3% |
QS_OVERALL
Real number (ℝ≥0)
| Distinct count | 480 |
|---|---|
| Unique (%) | 6.8% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 3.503253788415239 |
|---|---|
| Minimum | 2.0 |
| Maximum | 4.97 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 55.7 KiB |
Quantile statistics
| Minimum | 2 |
|---|---|
| 5-th percentile | 2.63 |
| Q1 | 3.13 |
| median | 3.503253788 |
| Q3 | 3.88 |
| 95-th percentile | 4.37 |
| Maximum | 4.97 |
| Range | 2.97 |
| Interquartile range (IQR) | 0.75 |
Descriptive statistics
| Standard deviation | 0.5254397319 |
|---|---|
| Coefficient of variation (CV) | 0.1499862024 |
| Kurtosis | -0.4725985847 |
| Mean | 3.503253788 |
| Median Absolute Deviation (MAD) | 0.3767462116 |
| Skewness | -0.007287861446 |
| Sum | 24904.63118 |
| Variance | 0.2760869118 |
| Value | Count | Frequency (%) | |
| 3.54 | 59 | 0.8% | |
| 3.26 | 57 | 0.8% | |
| 3.32 | 56 | 0.8% | |
| 3.56 | 55 | 0.8% | |
| 3.36 | 54 | 0.8% | |
| 3.34 | 53 | 0.7% | |
| 3.47 | 51 | 0.7% | |
| 3.2 | 51 | 0.7% | |
| 3.96 | 51 | 0.7% | |
| 3.74 | 50 | 0.7% | |
| Other values (470) | 6572 | 92.4% |
| Value | Count | Frequency (%) | |
| 2 | 1 | < 0.1% | |
| 2.06 | 2 | < 0.1% | |
| 2.09 | 1 | < 0.1% | |
| 2.11 | 1 | < 0.1% | |
| 2.18 | 3 | < 0.1% |
| Value | Count | Frequency (%) | |
| 4.97 | 1 | < 0.1% | |
| 4.95 | 1 | < 0.1% | |
| 4.94 | 1 | < 0.1% | |
| 4.93 | 1 | < 0.1% | |
| 4.9 | 1 | < 0.1% |
REG_FEE
Real number (ℝ≥0)
| Distinct count | 7038 |
|---|---|
| Unique (%) | 99.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 376938.33070755383 |
|---|---|
| Minimum | 71177 |
| Maximum | 983922 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 55.7 KiB |
Quantile statistics
| Minimum | 71177 |
|---|---|
| 5-th percentile | 197984.6 |
| Q1 | 272406 |
| median | 349486 |
| Q3 | 451562 |
| 95-th percentile | 669167.4 |
| Maximum | 983922 |
| Range | 912745 |
| Interquartile range (IQR) | 179156 |
Descriptive statistics
| Standard deviation | 143070.662 |
|---|---|
| Coefficient of variation (CV) | 0.3795598652 |
| Kurtosis | 1.126499412 |
| Mean | 376938.3307 |
| Median Absolute Deviation (MAD) | 85998 |
| Skewness | 1.037754561 |
| Sum | 2679654593 |
| Variance | 2.046921433e+10 |
| Value | Count | Frequency (%) | |
| 235229 | 3 | < 0.1% | |
| 348034 | 2 | < 0.1% | |
| 353677 | 2 | < 0.1% | |
| 441717 | 2 | < 0.1% | |
| 330086 | 2 | < 0.1% | |
| 518512 | 2 | < 0.1% | |
| 222526 | 2 | < 0.1% | |
| 424361 | 2 | < 0.1% | |
| 257917 | 2 | < 0.1% | |
| 264914 | 2 | < 0.1% | |
| Other values (7028) | 7088 | 99.7% |
| Value | Count | Frequency (%) | |
| 71177 | 1 | < 0.1% | |
| 95798 | 1 | < 0.1% | |
| 103928 | 1 | < 0.1% | |
| 106466 | 1 | < 0.1% | |
| 111366 | 1 | < 0.1% |
| Value | Count | Frequency (%) | |
| 983922 | 1 | < 0.1% | |
| 981117 | 1 | < 0.1% | |
| 963029 | 1 | < 0.1% | |
| 952411 | 1 | < 0.1% | |
| 947124 | 1 | < 0.1% |
COMMIS
Real number (ℝ≥0)
| Distinct count | 7011 |
|---|---|
| Unique (%) | 98.6% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 141005.7265438177 |
|---|---|
| Minimum | 5055 |
| Maximum | 495405 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 55.7 KiB |
Quantile statistics
| Minimum | 5055 |
|---|---|
| 5-th percentile | 35990.6 |
| Q1 | 84219 |
| median | 127628 |
| Q3 | 184506 |
| 95-th percentile | 292538 |
| Maximum | 495405 |
| Range | 490350 |
| Interquartile range (IQR) | 100287 |
Descriptive statistics
| Standard deviation | 78768.09372 |
|---|---|
| Coefficient of variation (CV) | 0.558616275 |
| Kurtosis | 1.073363345 |
| Mean | 141005.7265 |
| Median Absolute Deviation (MAD) | 49095 |
| Skewness | 0.9516562165 |
| Sum | 1002409710 |
| Variance | 6204412588 |
| Value | Count | Frequency (%) | |
| 117825 | 3 | < 0.1% | |
| 231426 | 2 | < 0.1% | |
| 95120 | 2 | < 0.1% | |
| 75962 | 2 | < 0.1% | |
| 145973 | 2 | < 0.1% | |
| 92784 | 2 | < 0.1% | |
| 48067 | 2 | < 0.1% | |
| 185864 | 2 | < 0.1% | |
| 97572 | 2 | < 0.1% | |
| 286822 | 2 | < 0.1% | |
| Other values (7001) | 7088 | 99.7% |
| Value | Count | Frequency (%) | |
| 5055 | 1 | < 0.1% | |
| 5126 | 1 | < 0.1% | |
| 5378 | 1 | < 0.1% | |
| 5620 | 1 | < 0.1% | |
| 5943 | 1 | < 0.1% |
| Value | Count | Frequency (%) | |
| 495405 | 1 | < 0.1% | |
| 491961 | 1 | < 0.1% | |
| 485924 | 1 | < 0.1% | |
| 481001 | 1 | < 0.1% | |
| 479297 | 1 | < 0.1% |
SALES_PRICE
Real number (ℝ≥0)
| Distinct count | 7057 |
|---|---|
| Unique (%) | 99.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 10894909.63918976 |
|---|---|
| Minimum | 2156875 |
| Maximum | 23667340 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 55.7 KiB |
Quantile statistics
| Minimum | 2156875 |
|---|---|
| 5-th percentile | 5630100 |
| Q1 | 8272100 |
| median | 10335050 |
| Q3 | 12993900 |
| 95-th percentile | 18790428 |
| Maximum | 23667340 |
| Range | 21510465 |
| Interquartile range (IQR) | 4721800 |
Descriptive statistics
| Standard deviation | 3768603.457 |
|---|---|
| Coefficient of variation (CV) | 0.345904976 |
| Kurtosis | 0.5881293416 |
| Mean | 10894909.64 |
| Median Absolute Deviation (MAD) | 2317605 |
| Skewness | 0.7733433359 |
| Sum | 7.745191262e+10 |
| Variance | 1.420237202e+13 |
| Value | Count | Frequency (%) | |
| 9817500 | 2 | < 0.1% | |
| 7195550 | 2 | < 0.1% | |
| 7629750 | 2 | < 0.1% | |
| 8033250 | 2 | < 0.1% | |
| 6519000 | 2 | < 0.1% | |
| 9213320 | 2 | < 0.1% | |
| 7855000 | 2 | < 0.1% | |
| 8191250 | 2 | < 0.1% | |
| 11930880 | 2 | < 0.1% | |
| 9429000 | 2 | < 0.1% | |
| Other values (7047) | 7089 | 99.7% |
| Value | Count | Frequency (%) | |
| 2156875 | 1 | < 0.1% | |
| 2476375 | 1 | < 0.1% | |
| 2640250 | 1 | < 0.1% | |
| 2797250 | 1 | < 0.1% | |
| 2939750 | 1 | < 0.1% |
| Value | Count | Frequency (%) | |
| 23667340 | 1 | < 0.1% | |
| 23407860 | 1 | < 0.1% | |
| 23314580 | 1 | < 0.1% | |
| 23307000 | 1 | < 0.1% | |
| 23247590 | 1 | < 0.1% |
HOUSE_AGE
Real number (ℝ≥0)
| Distinct count | 1652 |
|---|---|
| Unique (%) | 23.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 8867.524265016176 |
|---|---|
| Minimum | 1430.0 |
| Maximum | 20368.0 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 55.7 KiB |
Quantile statistics
| Minimum | 1430 |
|---|---|
| 5-th percentile | 2190 |
| Q1 | 5110 |
| median | 8583 |
| Q3 | 12410 |
| 95-th percentile | 16513 |
| Maximum | 20368 |
| Range | 18938 |
| Interquartile range (IQR) | 7300 |
Descriptive statistics
| Standard deviation | 4506.780646 |
|---|---|
| Coefficient of variation (CV) | 0.5082343743 |
| Kurtosis | -0.8414678596 |
| Mean | 8867.524265 |
| Median Absolute Deviation (MAD) | 3590 |
| Skewness | 0.2663986739 |
| Sum | 63039230 |
| Variance | 20311071.79 |
| Value | Count | Frequency (%) | |
| 2555 | 116 | 1.6% | |
| 1825 | 112 | 1.6% | |
| 7300 | 104 | 1.5% | |
| 6205 | 99 | 1.4% | |
| 5110 | 98 | 1.4% | |
| 2920 | 98 | 1.4% | |
| 4380 | 97 | 1.4% | |
| 6935 | 96 | 1.4% | |
| 5840 | 96 | 1.4% | |
| 5475 | 95 | 1.3% | |
| Other values (1642) | 6098 | 85.8% |
| Value | Count | Frequency (%) | |
| 1430 | 11 | 0.2% | |
| 1431 | 9 | 0.1% | |
| 1432 | 1 | < 0.1% | |
| 1433 | 4 | 0.1% | |
| 1460 | 25 | 0.4% |
| Value | Count | Frequency (%) | |
| 20368 | 1 | < 0.1% | |
| 20282 | 2 | < 0.1% | |
| 20250 | 1 | < 0.1% | |
| 20105 | 1 | < 0.1% | |
| 20075 | 6 | 0.1% |
Pearson's r
The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.
Spearman's ρ
The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.
Kendall's τ
Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.
Phik (φk)
Phik (φk) is a new and practical correlation coefficient that works consistently between categorical, ordinal and interval variables, captures non-linear dependency and reverts to the Pearson correlation coefficient in case of a bivariate normal input distribution. There is extensive documentation available here.Cramér's V (φc)
Cramér's V is an association measure for nominal random variables. The coefficient ranges from 0 to 1, with 0 indicating independence and 1 indicating perfect association. The empirical estimators used for Cramér's V have been proved to be biased, even for large samples. We use a bias-corrected measure that has been proposed by Bergsma in 2013 that can be found here.First rows
| AREA | INT_SQFT | DATE_SALE | DIST_MAINROAD | N_BEDROOM | N_BATHROOM | N_ROOM | SALE_COND | PARK_FACIL | DATE_BUILD | BUILDTYPE | UTILITY_AVAIL | STREET | MZZONE | QS_ROOMS | QS_BATHROOM | QS_BEDROOM | QS_OVERALL | REG_FEE | COMMIS | SALES_PRICE | HOUSE_AGE | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | Karapakkam | 1004 | 2011-04-05 | 131 | 1 | 1 | 3 | abnormal | Yes | 1967-05-15 | Commercial | All Pub | Paved | A | 4.0 | 3.9 | 4.9 | 4.330 | 380000 | 144400 | 7600000 | 16031.0 |
| 1 | Anna Nagar | 1986 | 2006-12-19 | 26 | 2 | 1 | 5 | abnormal | No | 1995-12-22 | Commercial | All Pub | Gravel | RH | 4.9 | 4.2 | 2.5 | 3.765 | 760122 | 304049 | 21717770 | 4015.0 |
| 2 | Adyar | 909 | 2012-04-02 | 70 | 1 | 1 | 3 | abnormal | Yes | 1992-09-02 | Commercial | ELO | Gravel | RL | 4.1 | 3.8 | 2.2 | 3.090 | 421094 | 92114 | 13159200 | 7152.0 |
| 3 | Velachery | 1855 | 2010-03-13 | 14 | 3 | 2 | 5 | family | No | 1988-03-18 | Others | NoSewr | Paved | I | 4.7 | 3.9 | 3.6 | 4.010 | 356321 | 77042 | 9630290 | 8030.0 |
| 4 | Karapakkam | 1226 | 2009-05-10 | 84 | 1 | 1 | 3 | abnormal | Yes | 1979-10-13 | Others | All Pub | Gravel | C | 3.0 | 2.5 | 4.1 | 3.290 | 237000 | 74063 | 7406250 | 10802.0 |
| 5 | Chrompet | 1220 | 2014-11-09 | 36 | 2 | 1 | 4 | partial | No | 2009-12-09 | Commercial | NoSewr | No Access | RH | 4.5 | 2.6 | 3.1 | 3.320 | 409027 | 198316 | 12394750 | 1796.0 |
| 6 | Chrompet | 1167 | 2007-05-04 | 137 | 1 | 1 | 3 | partial | No | 1979-12-04 | Others | All Pub | No Access | RL | 3.6 | 2.1 | 2.5 | 2.670 | 263152 | 33955 | 8488790 | 10013.0 |
| 7 | Velachery | 1847 | 2006-03-13 | 176 | 3 | 2 | 5 | family | No | 1996-03-15 | Commercial | All Pub | Gravel | RM | 2.4 | 4.5 | 2.1 | 3.260 | 604809 | 235204 | 16800250 | 3650.0 |
| 8 | Chrompet | 771 | 2011-06-04 | 175 | 1 | 1 | 2 | adj land | No | 1977-04-14 | Others | NoSewr | Paved | RM | 2.9 | 3.7 | 4.0 | 3.550 | 257578 | 33236 | 8308970 | 12469.0 |
| 9 | Velachery | 1635 | 2006-06-22 | 74 | 2 | 1 | 4 | abnormal | No | 1991-06-26 | Others | ELO | No Access | I | 3.1 | 3.1 | 3.3 | 3.160 | 323346 | 121255 | 8083650 | 5475.0 |
Last rows
| AREA | INT_SQFT | DATE_SALE | DIST_MAINROAD | N_BEDROOM | N_BATHROOM | N_ROOM | SALE_COND | PARK_FACIL | DATE_BUILD | BUILDTYPE | UTILITY_AVAIL | STREET | MZZONE | QS_ROOMS | QS_BATHROOM | QS_BEDROOM | QS_OVERALL | REG_FEE | COMMIS | SALES_PRICE | HOUSE_AGE | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 7099 | Adyar | 895 | 2011-05-01 | 197 | 1 | 1 | 3 | adj land | Yes | 1971-01-15 | House | NoSewr | No Access | I | 3.6 | 4.7 | 4.2 | 4.12 | 250641 | 7372 | 7371800 | 14716.0 |
| 7100 | T Nagar | 1733 | 2010-02-24 | 191 | 1 | 1 | 4 | abnormal | Yes | 1985-02-03 | Commercial | NoSewr | No Access | RL | 3.4 | 3.7 | 2.1 | 2.89 | 702058 | 312026 | 19501600 | 9152.0 |
| 7101 | Karapakkam | 666 | 2010-11-05 | 51 | 1 | 1 | 2 | adj land | Yes | 1974-05-20 | Others | ELO | Gravel | I | 3.2 | 4.4 | 2.5 | 3.28 | 273317 | 74541 | 6211750 | 13318.0 |
| 7102 | Karapakkam | 701 | 2010-03-02 | 100 | 1 | 1 | 2 | abnormal | No | 1990-08-02 | House | NoSewr | Gravel | RH | 4.2 | 3.0 | 2.0 | 2.96 | 282175 | 141088 | 5643500 | 7152.0 |
| 7103 | Karapakkam | 1462 | 2010-04-23 | 68 | 2 | 2 | 4 | family | No | 1986-04-29 | Others | NoSewr | Gravel | RM | 2.7 | 3.3 | 3.6 | 3.24 | 356716 | 178358 | 9387250 | 8760.0 |
| 7104 | Karapakkam | 598 | 2011-03-01 | 51 | 1 | 1 | 2 | adj land | No | 1962-01-15 | Others | ELO | No Access | RM | 3.0 | 2.2 | 2.4 | 2.52 | 208767 | 107060 | 5353000 | 17942.0 |
| 7105 | Velachery | 1897 | 2004-08-04 | 52 | 3 | 2 | 5 | family | Yes | 1995-11-04 | Others | NoSewr | No Access | RH | 3.6 | 4.5 | 3.3 | 3.92 | 346191 | 205551 | 10818480 | 3196.0 |
| 7106 | Velachery | 1614 | 2006-08-25 | 152 | 2 | 1 | 4 | normal sale | No | 1978-01-09 | House | NoSewr | Gravel | I | 4.3 | 4.2 | 2.9 | 3.84 | 317354 | 167028 | 8351410 | 10455.0 |
| 7107 | Karapakkam | 787 | 2009-03-08 | 40 | 1 | 1 | 2 | partial | Yes | 1977-11-08 | Commercial | ELO | Paved | RL | 4.6 | 3.8 | 4.1 | 4.16 | 425350 | 119098 | 8507000 | 11443.0 |
| 7108 | Velachery | 1896 | 2005-07-13 | 156 | 3 | 2 | 5 | partial | Yes | 1961-07-24 | Others | ELO | Paved | I | 3.1 | 3.5 | 4.3 | 3.64 | 349177 | 79812 | 9976480 | 16060.0 |